Preprint typeset in JHBP style - HYPER VERSION 



KEK-TH-1369 
IPMU 10-0091 



Search for the Elusive Higgs Boson Using Jet 
Structure at LHC 



Chuan-Ren Chen 

Institute for the Physics and Mathematics of the Universe, 
The University of Tokyo, Chiba, 277-8583, Japan 

Mihoko M. Nojiri 

Institute for the Physics and Mathematics of the Universe, 
The University of Tokyo, Chiba, 277-8583, Japan 
Theory Group, KEK,1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan 
The Graduate University for Advanced Studies (Sokendai), 
1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan 

Warintorn Sreethawong 

Theory Group, KEK,1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan 
The Graduate University for Advanced Studies (Sokendai), 
1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan 

Abstract: 

We consider the production of a light non-standard model Higgs boson of order 
100 GeV with an associated W boson at CERN Large Hadron Collider. We focus 
on an interesting scenario that, the Higgs boson decays predominately into two light 
scalars x with mass of few GeV which sequently decay into four gluons, i.e. h — 
2x — 4(7. Since x is much lighter than the Higgs boson, it will be highly boosted 
and its decay products, the two gluons, will move close to each other, resulting in 
a single jet for x decay in the detector. By using electromagnetic calorimeter-based 
and jet substructure analyses, we show in two cases of different x masses that it is 
quite promising to extract the signal of Higgs boson out of large QCD background. 
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1. Introduction 

Even though the Standard Model (SM) successfully explains almost all of the data 
in collider experiments, the Higgs boson, which is associated with the electroweak 
symmetry breaking mechanism in the SM, has not yet been found. The current 
limits on the Higgs boson mass, ruh, come from the direct searches at LEP 11 and 
Tevatron, where a lower bound of 114 GeV has been obtained and a window of 
158 GeV < uih < 175 GeV has been excluded, respectively, at 95% confidence level 
(C.L.) m, 0. Combined with the direct search limit, the global fits to electroweak 
precision data prefer a light Higgs boson, nih < 191 GeV at 95% C.L. 0. The 
Large Hadron Collider (LHC) at CERN in-operation is expected to discover the 
SM Higgs boson with mass up to 1 TeV, if such a particle exists. At the LHC, 
the Higgs boson is predominately produced in the gluon-gluon fusion process via a 
top quark loop. Other important production processes are vector-boson fusion and 
production of the Higgs boson with an associated W/Z boson (associated hW/hZ 
production). The search strategy for the Higgs boson depends on its mass. When 
the Higgs boson is heavy (m/j > 135 GeV), the channels of /i — )■ VTll/^*) — > £^£2 z/iz/2 
and h — ZZ — )■ £^£^"£^£2 very promising. For a light Higgs boson {rrih < 135 
GeV), inclusive /i — )■ 77 and h — t- r+r^ in vector-boson fusion production process 
are the main search channels while the dominant decay channel h bb suffers from 
a very large QCD background. 
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Recently, the associated Zh/Wh production combined with h ^ bb search, which 
has been considered less promising, was recovered by focusing on the regime where 
the Higgs boson is highly boosted and employing a jet substructure technique, and 
it was reported that the significance about 4(T can be achieved for 30 fb~^ integrated 
luminosity at the LHC ^. The idea is that when a heavy particle is highly boosted, 
its decay products will be emitted very coUinearly so that they are likely to be merged 
into a single jet rather than to appear as separate jets. Such jet should have internal 
structures corresponding to decay products, including their soft /coUinear radiations, 
and the jet mass should peak around the mass scale of the parent particle. Jet 
substructure has also been applied in searches for a boosted top quark decay |P, ^, 
strongly interacting W bosons and supersymmetric particles 0. 

Furthermore, in spite of the great agreement between the SM predictions and 
experimental results, it is widely believed that the SM is only an effective theory at 
the weak scale due to, for instance, the so-called "hierarchy problem" and lack of 
a dark matter candidate in the SM. Many new models beyond the SM have been 
proposed over past decades, and in most of them the Higgs sector is extended. It 
should be emphasized that the current bound on the Higgs boson mass cannot be 
directly applied to such models especially when properties of the Higgs sector are 
significantly modified. For example, the LEP lower bound is based on studies of the 
associated Zh production with h — )■ bb. Therefore, the limit can be relaxed if the 
production cross section, or equivalently an effective coupling gzzh, is suppressed. 
Another interesting possibility is that the Higgs boson is hidden, i.e. it has non-SM 
decay modes and escapes from detection even though it might have been produced 
at LEP and Tevatron. Particularly, if the decay of the Higgs boson into not-yet- 
discovered light particles dominates, the direct search strategies have to be revised 
to capture the nature of new decay topologies. Examples of scenario are the exten- 
sions of Minimal Supersymmetric Standard Model (MSSM) including an additional 
singlet [|^, [ill or CP- violating interactions [Q, the little Higgs models ||13| and su- 



persymmetric little Higgs models |T^, |I5|. In most of these models, the Higgs boson 
dominantly decays into a pair of new light scalars x which subsequently decay into 
two or more visible SM particles. The phenomenology of such non-SM decays has 
been studied both at the Tevatron [|T^, [T^ and at the LHC |rT], |T^, |rB]. The most 
commonly studied cascade decays are /i -> 2% — > 46/4r/262r. 

In this paper, we study a scenario in which a light Higgs boson (< 135 GeV) 
decays predominately into a pair of very light pseudoscalars, ( < 2mft) each of 
which then decays into two gluons, i.e. /i — )• 2% — )■ Ag. A model which predicts such 



a decay channel will be briefly described later in Section 2.1. However, we stress 



that we do not restrict ourselves to a particular model since we treat the branching 
ratios of h 2x and x ~^ 2(7 as free parameters. This channel will encounter the 
large QCD background, we therefore focus on the associated Wh production process 
where W boson decays leptonically in order to suppress the background. Since the 
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mass splitting between Higgs boson and x particle is large, two pseudoscalars will be 
highly boosted. An opening angle between two gluons from each x decay, which is 
also limited by the small mass of x will be so small that they are closely aligned and 
typically present in a single jet in the detector. The collider signature is essentially 
two central non-6 jets, an isolated charged lepton and missing momentum. We will 
show that the signal for a relatively light x can be feasibly discovered at the LHC by 
imposing cuts on electromagnetic-calorimeter-based (E-cal-based) variables, together 
with some kinematic cuts. For the case of heavier x the angular separation between 
daughter gluons from a pseudoscalar decay is larger, so the E-cal-based analysis 
fails. However, by employing the jet substructure technique previously mentioned, 
we demonstrate that such a scenario still can be discovered. For illustration, we 
adopt nih = 120 GeV, 100% decay branching ratio for /i — )■ 2% and x ~^ 2(7, and two 
masses for x'- f^x ~ ^ GeV (light) and = 8 GeV (heavy). 

The paper is organized as follows. In the next section, we firstly give a brief 
review on the model of elusive Higgs boson. Afterward, signal and background 
generations are described. In Section 3, we present a search strategy using narrow 
jet at the LHC for the case = 4 GeV. Jet substructure method and its application 
to the case m^^ = 8 GeV are presented in Section 4. Lastly, Section 5 is devoted to 
discussions and conclusions. 



2. Higgs Decay h ^ 2x ^ at the LHC 

2.1 A Reference Model 

For models beyond the SM, it is not unusual that a light singlet scalar appears as 
a result in the extended Higgs sector. If there exists a coupling between the Higgs 
boson and the light scalar, the Higgs may predominately decay into a pair of such 
light scalars. The dominant decay channel of the light scalar then depends on its mass 
and couplings to SM fermions. Here we briefly review a supersymmetric extension 



of the simplest Little Higgs model |]14|, in which the light scalar will mainly decay 
into a gluon pair when its mass is lighter than twice the bottom quark mass, which 
is the scenario we focus on in this paper. We only introduce the phenomenology in 
Higgs sector and we refer readers to the original paper for the details of the model. 

The model is a supersymmetric extension of the simplest Little Higgs model [|T^]. 
The gauge group is SU{3)c x SU{3)w x U{l)x, where the SU{3)w x U{l)x is 
broken by two vector-like sets of Higgs superfields, ^u,d and 'Hu,d- The misalignment 
between the vacuum expectation values (VEVs) of these two Higgs fields leads to 
electroweak symmetry breaking. There exists an approximate SU (3)i x SU (3)2 global 
symmetry due to the absence of the cross terms between $ and Ti. The VEV of $ 
at the order of 10 TeV breaks the SU{3)i and the gauge group down to the SM 
SU{3)c X SU{2)w X U{1)y, generating five massive gauge bosons. The VEV of Ti, 
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which is generated radiatively with an order of few hundreds GeV, breaks another 
set of SU{3) X U{1) — )• SU{2) x U{1) and produces five Goldstone bosons, three 
of which are eaten by W and Z bosons after electroweak symmetry breaking, and 
the two remnants are identified as Higgs boson h and hght pseudoscalar x- The 
couphng between h and x is generated due to the fact that h lives partly in the 
third component of T-Lu^d, where x mainly stays. After the Higgs gets a VEV and 
canonically normalizing the Higgs and pseudoscalar fields, the tree-level interaction 



between h and x generated from the Higgs kinetic terms is given as |I4 



c,,,^-^id,xr , ■ (2.1) 

The coupling of the Higgs boson to the SM fermions is the same as that in the SM 
but with an additional factor a/1 — v'^m/'^n- 

The decay widths of the Higgs boson into a pair of x's and the SM fermions / 
are therefore 

r..// - - J) r^"//' (2-3) 



where v^w = 174 GeV is the electroweak scale related to the Higgs VEV, v-h ~ 
300 GeV ~ 500 GeV and T^^^^- is the decay width of the SM Higgs boson into a 
fermion pair. After electroweak symmetry breaking, x acquires mass naturally of 
order few to (9(10) GeV. The coupling between the singlet x to the SM fermion / is 
induced by the mixing of the SM fermion and its heavy partner /', and is suppressed 
by a factor of m'j/m'j,. Therefore, only the couplings of third generation fermions to 
X should be considered. When x is heavier than 2mb, it decays almost 100% into a 
pair of b quarks. Below the threshold of two b quarks, the dominant decay channel of 
X is always X ^ 99 and the subdominant channel (branching ratio < 1%) is x 77 
or X — i- r+r^ depending on the mass of x- For example, with the parameters chosen 
in the Fig. 7 of the Ref 0] the branching ratio of x ^ 11 is larger (smaller) than 
X — > r+r^ when x is heavier (lighter) than about 8 GeV. 

In summary, we review in this section a reference model in which the light Higgs 
boson mainly decays into two singlet light scalars which sequently decay into four 
gluons, i.e. /i — > 2% — 4g. In our numerical study, we take 100% branching ratio for 
both h ^ 2x and x ^ '^9 fo^' = 120 GeV and < 2m6, since we only focus 
on this non-SM Higgs decay scenario and will explore it in a model independent 
approach. For any particular model which predicts such a scenario, our result can 
be easily applied by rescaling the production cross section of the Higgs boson and 
branching ratios. 
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2.2 Signal and Background Generations 



For signal event generation, we employed the Herwig 6.5 Monte Carlo event gener- 
ator [|l9l at the LHC with ^/s = 14 TeV center-of-mass energy. We generated the 
heavier neutral MSSM Higgs boson in association with a W boson and forced the 
Higgs boson to decay into two pseudoscalars each of which subsequently decays into 
two gluons. All backgrounds were generated by Alpgen 2.13 with showering and 
hadronization by Herwig. In our analysis, we particularly forced W ^ eu for both 
signal and background processes, therefore the final statistics should be twice our re- 
sult when W ^ fiu is also considered. The leading order cross sections incorporating 
branching fraction for signal and background processes are listed in Table |^. Note 
that the signal cross section is normalized to the SM value. We present in detail two 
Higgs search strategies for = 4 GeV and = 8 GeV in the next two sections: 
the electromagnetic calorimeter-based method and the jet substructure method, re- 
spectively. For the former analysis, Wjj is the dominant background process. For 
the jet substructure analysis, tt,tW and tbW backgrounds are also important. 

After event generation, we em- 



process 


cross section @ LO (pb) 


hW — 7- eujj 


0.28 


Wjj eujj 


1401.64 


WW eujj 


11.16 


ti — eubbjj 


65.53 


tq — > eubj 


10.61 


tW evbjj 


7.99 


tbW — )■ evbbjj 


12.17 



ployed Fast jet for jet reconstruc- 



Table 1: Production cross sections for signal and 
background processes. 



tion. We used Cambridge-Aachen 
(CA) and KT jet algorithms in this 
study. Moreover, all events are sub- 
jected to the following pre-selection 
cuts: 

(i) an isolated lepton with trans- 
verse momentum pti > 20 GeV, 

(ii) at least two jets with transverse 
momenta Pti > 40 GeV,pT2 > 30 GeV 
and pseudo-rapidities \rii^2\ < 2, 

^/2{EtiE^''''' — pj^i ■ p™*"") < mw (lepton mass is ne- 



(iii) a transverse mass ttit 
glected) where E^^^^ and p™"^*^ are transverse missing energy and momentum respec- 
tively, 

(iv) no tagged 6-jets. 

Note that the energy smearing effects of isolated photons, charged leptons and jets 
are taken into account in the pre-selection cut ||2^. The cut on transverse mass is 
used to reduce backgrounds with an accidental isolated lepton which is not from W 
boson decay. Moreover, we veto &-jet to reduce backgrounds from top quark events 
and assume 60% 6-tagging efficiency. 



3. Narrow Jet and = 4 GeV Case 



Since = 4 GeV is much lighter than the Higgs boson (m^ = 120 GeV), the 
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decay products of x will be very coUinear, generating a very narrow single jet in the 
detector. We therefore resolve the narrow jet structure with the aid of fine granularity 
of electromagnetic calorimeter, which has been well studied for the hadronic decay 



of r lepton [23 



We now introduce variables adapted in the reconstruction technique for the 
hadronically decaying r, which will be used to characterize a jet from x decay. The 
starting point of our analysis is to define jet axes. We use true parton momentum 
axes of X particles for signal events, while take two hardest jet axes from jet-clustering 
algorithm for background events. Then, we define a cone of radius i? = 0.4 around 
each jet axis and calculate the following quantities: 

• The electromagnetic transverse energy {E^) 

The electromagnetic transverse energy is defined as a sum of transverse energy 
deposited in the electromagnetic calorimeter and transverse energy of charged 
particles within a cone radius i?, i.e. 



^ E^^^„ (3.1) 



i?<0.4 

where i runs over all electromagnetic calorimeter cells and charged particles 
within a cone of radius R = 0.4. In this analysis, we take the granularity of 
electromagnetic calorimeter cells Arj x A(f) = 0.025 x 0.025 corresponding to 
the middle layer of ATLAS calorimeter This variable is similar to the jet 
transverse energy, however, cut on this variable is tighter and more effective to 
reduce backgrounds. 

The electromagnetic radius (-Rem) 

The electromagnetic radius (-Rem) is defined as 

p Ei^|-"\/fa-^jet)^ + ('^.-'/>jet7 

where {jji^cpi) and (r7jet,0jet) denote (pseudo-rapidity, azimutual angle) of the 
2th cell and the jet axis, respectively. This variable gives an effective cone 
radius in which a large fraction of energy is deposited. For collimated jets, -Rem 
tends to be narrow and peak at small value. However, it should be noted that 
the efficiency of cut on this variable is -E^- dependent. When Et is increasing, 
the -Rem distribution will be narrower. Therefore, this variable becomes less 
effective when Et is large. 

The energy isolation in the calorimeter {Ei^o) 
The energy isolation is defined as 

-E'iso = ^ E^JE^ (3.3) 

ri<R<r2 
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Figure 1: Non-normalized distributions of electromagnetic calorimeter-based variables, 
i?em and -Eiso {E^ > 30 GeV), for signal with = 4 GeV (upper panels) and for Wjj 
background (lower panels). 



where i runs over all electromagnetic calorimeter cells and charged particles 
inside a ring of ri < R < r2- In our study we take ri = 0.1 and r2 = 0.4. For 
highly coUimated jets, most QCD activities distribute near the axis of core, 
i.e. within R < 0.1, and the i^iso distribution is narrow with a peak at zero. 
Similar to -Rem, performance of ii^iso cut is also i?T- dependent. In addition, this 
variable is expected to be less effective for events with higher hadronic activity 
such as ti events. 

We show in Figure |1] the non-normalized distributions of i?em and Eiso {E^ > 
30 GeV) for m-^ = 4 GeV signal (upper panels) and for the dominant Wjj back- 
ground (lower panels). For comparison, we also show similar distributions for differ- 
ent SM Higgs decay channels: h — )• gg, h ^ bb, h ^ uu, and h — )• r+r~ in Appendix 
A. From Figure |l|, the distributions for signal are clearly narrower than those for 
Wjj background. Almost all signal events has Rem smaller than 0.1, while the dis- 
tribution for Wjj events extends beyond 0.2. Similarly, i^iso for signal is less than 
0.5, but the background distribution has a long tail up to Eis^ ~ 1- This indicates 
that high energy objects inside a jet are more highly coUimated for signal. 



-7- 



2500 F 




dl I I I I I I I I I I I I I I I I I L. 



5 10 15 

ifij ( GeV) 



5 10 15 
mj ( GeV) 



20 



Figure 2: Non- normalized jet mass distributions of the first two hardest jets for signal 
with = 4 GeV (left) and for Wjj background (right) after imposing pre-selection cuts. 



Selection cut 


Wh 

(m^ = 4 GeV) 


Wjj 


WW 


tt 


tq 


tw 


thW 


pre-selection cuts 


811 


382,685 


1,868 


2,460 


412 


644 


400 


nj<A 


772 


345,211 


1,444 


600 


380 


410 


112 


ruj < 8 GeV 


601 


115,214 


620 


288 


118 


112 


52 


i?cm < 0.06 


456 


37,735 


194 


108 


16 


40 


32 


Eiso < 0.15,^^"^ > 30 GeV 


450 


25,646 


125 


124 


14 


32 


24 


£^r(ji,2) > 100,50 GeV 


98 


1,431 


3 


56 


2 


8 





after imposing all cuts above 


29 


17 


















Table 2: Number of expected signal and background events after cuts in the dijet invariant 
mass window 110 GeV < rujj < 130 GeV for £ = 30 fb~^ at the LHC. The analysis is 
based on the KT algorithm. 



In our analysis, we choose the following cuts: 



• i?em < 0.06, 

• < 0.15, 

• ^r(ji) > 100 GeV, £;r(j2) > 50 GeV. 

In addition to electromagnetic calorimeter-based variables introduced above, we 
also employ cuts on another two variables: the number of jets ^ 

and jet mass. The signal events are expected to contain two jets with few addi- 
tional jets from initial/final state radiations. In contrast, tt and tbW backgrounds 

^The transverse momentum and rapidity of jets defined through this paper are: px > W GeV 
and \r]\ < 5. 
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are expected to have high jet multiphcity in the final states and then can be sup- 
pressed by imposing the upper hmit on number of jets. The key difference in jet 
mass distribution between signal and QCD background is the following. For a heavy 
particle decaying hadronically into a single jet, the jet mass distribution will ex- 
hibit an enhancement structure corresponding to the mass scale of the particle. For 
QCD backgrounds, even though original partons are massless, the QCD splitting 
can generate nonzero masses to the jets. In perturbative theory, jet mass appears at 
next-to-leading order (NLO) where two massless partons can be present in a single 



jet. The average value is approximately given by p4| 



m]) ^ C%lR\ (3.4) 



7r 



where C is a coefficient that depends on the type of partons and i? is a parameter 
equivalent to cone radius. At the NLO, there is no difference between two jet re- 
combination algorithms because there is only one way to combine two parton into a 
jet. Beyond the average jet mass, the distribution falls smoothly due to the lack of 
any intrinsic mass scale. In Figure |], we show non-normalized jet mass distributions 
from KT algorithm with R=0.4 for signal (left) and Wjj background (right) after 
imposing pre-selection cuts. The peak of signal distribution is a bit higher than the 
true pseudoscalar mass due to gluon radiation contribution. Although the distribu- 
tion for Wjj background seems to peak near the pseudoscalar mass value, however, 
we should keep in mind that the position of the peak is approximately linear in 
Pt- Imposing high-p^ cut along with the upper bound of jet mass can be useful for 
background reduction. 

In Table 0, we show the number of expected signal and background events in 
a dijet invariant mass window 110 GeV< mjj < 130 GeV for 30 fb"^ integrated 
luminosity at the LHC ^. The first row shows number when only pre-selection cuts 
are applied. Lower entries are numbers when cuts on number of jets, jet mass, and 
electromagnetic calorimeter-based variables are separately imposed. The final row 
shows the number of events when we use all cuts. 

As we can see in Table ^ after imposing pre-selection cuts, Wjj is the dominant 
background, and WW and ti with 6-jets misidentified as leptons are also significant. 
When rij < 4 is imposed, tt and tbW events are reduced by a factor of four. By 
demanding rrij < 8 GeV, the number of backgrounds is reduced by a factor of 
3 ~ 10. Again, it should be emphasized that these numbers are before applying 
high-pT cut on jets. The efficiency of Rem and i^iso cuts are impressive. They remove 
backgrounds by one order of magnitude while keeping more than half of signal events. 
Lastly, since cut is equivalent to px cut and even tighter, considering high-i?^™ 
events hence corresponds to focusing on a boosted Higgs boson regime for signal. 



^Thc momentum smearing is parametrized by Gaussian resolution bOVoy'E p^ . Therefore, the 
dijet invariant mass resolution for pxj^ 2 — IOOj 50 GeV is about 10 GeV. 
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Figure 3: Dijet invariant mass distributions for pp — )• Wh and /i — )• 2x — ?• 2j (m^ = 
4 GeV) and SM backgrounds from electromagnetic calorimeter-based analysis using CA 
algorithm (left) and KT algorithm (right) for £ = 30 fb~^ at the LHC. 



This cut is proved to be very helpful for Higgs discovery in our study. This is 
because jets in several background processes are coming from hadronic W decay or 
mistagged 6-jet. In the case that both jets are from single W, it is unlikely that 
they are simultaneously hard. The condition of E^{ji 2) > 100,50 GeV will select 
roughly 10% of signal events out of those after imposing pre-selection cuts. While 
only 0.37% and 2.28% of Wjj and ti events, respectively, can pass this cut, which 
are, however, still large enough to dominate over signal. On the contrary, all other 
backgrounds are suppressed to the negligible level. Another advantage of cut is 
that the invariant mass of uncorrelated dijet moves toward higher mass region when 
a harder cut is adopted, but the signal always peaks at the value of the Higgs 
boson mass. 

After applying all cuts altogether, the 
dijet invariant mass distributions for = 
4 GeV signal and backgrounds are shown in 
Figure | for £ = 30 fb"^ at the LHC. The 
CA (KT) algorithm is used in analysis in the 
left (right) plot. The Higgs boson signal can 
be clearly visible above the backgrounds and 
the performances of CA and KT algorithms 



Jet algorithm 


as (fb) 


s/Vb 


CA 
KT 


1.13 
0.97 


7.09 
7.03 



Table 3: Signal cross section and statis- 
tical significance after all cuts in the dijet 
invariant mass window 110 GeV < rrijj < 
130 GeV for £ = 30 fb~^ at the LHC. 



agree well with each other. The signal cross section and statistical significance after 
imposing all cuts in the dijet invariant mass window 110 GeV < mjj < 130 GeV are 
listed in Table |^, which shows that a 7a significance level can be achieved. 
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Figure 4: Non-normalized distributions of electromagnetic calorimeter-based variables, 
-Rem and E'isoi for signal with = 8 GeV. 

4. Jet Substructure and = 8 GeV Case 

When the mass of x becomes larger, the angular splitting between two gluons from 
X decay increases and, on average, the transverse energy spreads into broader region 
inside a defined cone. This can be clearly seen in the i?em and £"180 distributions shown 
in Figure |[ They are even harder and broader than those for Wjj background (see 
lower panels of Figure 0). As a result, we found that the electromagnetic calorimeter- 
based method fails as a strategy for Higgs discovery if pseudoscalar particle is not 
light enough. 

In the following, we take another approach based on jet substructure technique 
to identify the hadronic decays h 2x . 

Jet Substructure 

When reconstructing jets, one has to adopt an algorithm which iteratively merges 
protojets - experimental objects such as calorimeter towers, clusters, or final state 
particles - into jets. Therefore, jet recombination process in jet-finding algorithms, 
like CA and KT in our study, naturally preserves the characteristic substructure 
for a fat jet initiated from a boosted heavy particle decay. Recently, the use of the 
jet internal structure has provided us a new window for reconstruction of a heavy 
particle which hadronically decays into a single jet, such as a boosted Higgs boson 
decay and a boosted top quark decay f^, [^l- In our scenario, since the pseudoscalar 
particle is highly boosted and decays into a pair of massless gluons, we then expect 
that the jet from x decay contains rather symmetric and two hard substructures. 
We follow the prescription in Ref. |^ to find jet substructure as: 

1. Firstly, particles (excluding an isolated lepton) are clustered into jets with a 
radius R. For comparison, we use two clustering algorithms: CA and KT 
algorithms which have hierarchical structure for the clustering in angles and in 
relative transverse momenta, respectively. 
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2. For the two hardest jets, the substructure is searched for by undoing the last 
recombination step, namely breaking each jet j into two subjets, ji and j2 such 
that rrij^ > rrij^. 

3. If there is a significant mass drop, mj^ < firrij and the splitting is not too asym- 
metric, y = — (ptji'Ptj2^ ^j^2 ^ mm(pTj^,pTj^) ^ Considered 

as a pseudoscalar particle neighborhood - including the daughter gluons and 
QCD radiation - and the finding process stops. 

4. Otherwise, replace j by ji. If the ratio of ptj to the original jet pt is larger 
than a parameter 6p, then go back to repeat from step 2; otherwise, the process 
stops. 



For this analysis, we take R = 0.5 which is smaller than a typical value R ~ 
1.2 — 1.5 in other literatures. This is because the pseudoscalar particle x is much 
lighter than its mother particle, the Higgs boson, and is highly boosted, therefore, 
the jet initiated from x decay should be narrow. Other three parameters fi, ycut and 
6p can be chosen independently. It was pointed out in |^ that the decay into a three 
equally energy shared configuration, i.e. ggg configuration, will still trigger the mass 
drop condition if one takes /i > 1/v^. Here, we use /i = 0.67. The cut on y is used to 
eliminate the asymmetric configurations from soft radiations that usually generate 
significant jet mass and the parameter Sp is introduced just to prevent the algorithm 
to go back too many steps. In this work, we take ?/cut = (0.35)^ ~ 0.12 and 6p = 0.2. 

In addition, we also expect that at the scale where two subjets are resolved, 
the KT distance, d = mm[p'^{subji),p'^{subj2)]AR'^^fjj_^^/R'^, will be of the order 
O(m^) p, 1^, where PT{subji 2) are the transverse momenta of the subjets 1 and 2, 
and Ai?^„^j-^2 is the AR^ between subjets 1 and 2. The distributions of \og{\/d) for 
signal and Wjj background are shown in Figure ^. We define the "subjet cut" as 
the requirement that jet substructures can be found and 0.75 < log(v^) < 2.0 is 
satisfied for both of two hardest jets. 

The number of expected signal and background events after imposing cuts in 
the dijet invariant mass window 110 GeV< rrijj < 130 GeV are shown in Table § for 
£ = 30 fb"^ at the LHC. A gain, the first row is the number of events after imposing 
pre-selection cuts, and lower entries are numbers when cuts on number of jets, jet 
mass, and jet transverse momentum are separately applied. 

In this analysis, the largest background is Wjj because of its very large cross 
section. However, ti background is also significant and difficult to eliminate. This 
is because 6-quark from each top decay has an energy close to mh/2 in the top rest 
frame and dijet invariant mass distribution from tt has a scale close to Higgs mass 
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log(d)/2 log(d)/2 

Figure 5: Non-normalized distributions of log{^/d) for signal with = 8 GeV and Wjj 
background. 
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WW 


tt 


tq 


tw 
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13,332 
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Uj < 3 
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393,372 


3,796 
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1,314 


TTij < 12 GeV 


435 


175,254 


2,739 


30,949 


3,728 


3,596 


6,288 


PTn,2 > 100, 50 GeV 


112 


14,927 


75 


31,444 


242 


1,286 


5,490 


imposing all cuts (except 


43 


513 


5 


81 


14 


28 


22 


sub jet cut) 
















after including sub jet cut 


16 


16 





2 












Table 4: Number of expected signal and background events after cuts in the dijet invariant 
mass window 110 GeV< rrijj < 130 GeV for £ = 30 fb^^ at the LHC. The analysis is based 
on KT algorithm. 



scale. By putting more stringent cut nj < 3 ^ , ti background can be reduced by a 
factor of 20. The effect of jet mass and px cuts are similar to the discussion in the 
previous analysis. However, the cut in this analysis is somehow looser so that many 
backgrounds can still survive. 

We show in Table ^ the cross section of signal and statistical significance after 
imposing all cuts. It can be seen that, in the mass window between 110 GeV and 
130 GeV, one can reach a 2.65a and 3.77cr statistical significance using CA and KT 
jet-finding algorithms, respectively, with 30 fb~^ luminosity at the LHC. We also 

■'The number of jets is sensitive to underlying events and effects of higher-order QCD calculations. 
If the number of events which contain more than two soft jets is significant, the stringent cut on 
number of jet, rij < 3, will largely reduce our signal. And also, the number of jets in tt background 
depends sensitively on pT and \ri\. For example, when we impose pt > 20 GeV and \ri\ < 3.5 in jet 
definition, the tt events with rij < 3 will increase by a factor of 3 ^ 4, while the our signal and the 
main Wjj background increase by factors of about 1.2 and 1.3, respectively. 
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Figure 6: Dijet invariant mass distributions for pp — )• Wh and /i — )• 2x — ?• 2j {m^ = 
8 GeV) and SM backgrounds from jet substructure analysis using CA algorithm (left) and 
KT algorithm (right) for £ = 30 fb"^ at the LHC. 



notice that the KT algorithm seems to perform better than the CA algorithm which 
is in contradiction of the conclusion from Ref. 0. However, it should be kept in 
mind that the angular splitting between two gluons from x decay is small and the 
configuration is indeed different from one in Ref. [Q. In addition, the statistics 
is rather low and smearing effect is not taken into account in this analysis, so more 
realistic simulation should be performed before more decisive conclusion can be made. 

Furthermore, the acceptances of the jet 



Jet algorithm 


(Ts (fb) 


s/Vb 


CA 
KT 


0.43 
0.53 


2.65 
3.77 



mass cut rrij < 12 GeV are different for KT 
and CA algorithms. As we can see in the Ta- 
ble ^, about 33% and 20% of our main back- 
ground Wjj and ti events pass the jet mass 
cut, respectively, when using KT algorithm. 
However, we found that the acceptances in- 
crease up to about 40% for Wjj and 30% for 
tt background events when CA algorithm is 
adopted^. 

The dijet invariant mass distributions after including sub jet cut are shown in 
Figure ||. The left and right plots are obtained by using CA and KT algorithms, 
respectively. We can see that the signal peak can be visible above backgrounds in 
both plots and the invariant mass distribution for signal is very sharp since smearing 
is off as mentioned earlier. 



Table 5: Signal cross section and statis- 
tical significance after all cuts in a dijet 
invariant mass window 110 GeV< rrijj < 
130 GeV from jet substructure analysis 
for £ = 30 fb^^ at the LHC. 



5. Discussions and Conclusions 

In this paper, we studied the signature and the discovery potential of an elusive 
^The acceptances of other cuts are similar for KT and CA algorithms. The event numbers of 
signal and total background are 13 and 24, respectively, after imposing all of the cuts using CA 
algorithm. 
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Higgs boson which is produced in association with a W boson at the LHC. The 
interesting phenomenology is that the Higgs boson decays dominantly into a pair 
of hght pseudoscalar particles, X) which then decay into a pair of gluons if < 
2mh. The final state, in general, contains at least four jets corresponding to four 
daughter gluons from the Higgs boson cascade decay. However, due to its small 
mass, pseudoscalar becomes highly boosted; as a result, the two gluons from its 
decay move coUinearly and appear as a single jet in the detector. Therefore, the 
signature of such Higgs boson is then two central narrow non-b-jets. 

We proposed two methods to search for such elusive Higgs boson at the LHC: 
electromagnetic calorimeter-based and jet substructure analyses, depending on the 
mass of the x- We took two benchmark masses, = 4 GeV and = 8 GeV, 
to demonstrate how these two methods work. Electromagnetic calorimeter-based 
variables are introduced for characterization of a very narrow jet from x decay. We 
showed that for — 4 GeV case, most of jet energy is deposited around the jet 
core axis and the imposition of cut on electromagnetic calorimeter-based variables is 
very useful to suppress the background. With 30 fb~^ integrated luminosity, we can 
reach more than 7(J statistical significance at the LHC for a 120 GeV Higgs boson. 
When is larger (but still < 2mh), e.g. = 8 GeV, activities inside a jet of signal 
spread out and distributions of electromagnetic calorimeter-based variables become 
broader and look like those from backgrounds. The cut on these variables is therefore 
not effective enough to dig the Higgs boson signal out of a huge Wjj background. 
We then turned to use a jet substructure method, which recently has gained much 
attention as a powerful tool to search for hadronically decaying heavy particles. We 
demonstrated that the signal significance about 3.77(7 can be obtained with 30 fb~^ 
integrated luminosity at the LHC when KT jet-finding algorithm is used. 

Several comments are in order. We adopt the SM value for the Higgs production 
cross section and 100% branching fraction for h ^ 2x and % — )> 2(? decays, since our 
purpose is to show the feasibility of the discovery for such an elusive Higgs boson 
at the LHC. Our results can be easily rescaled for a specific model. It should be 
also emphasized again that, in our analysis, we specifically forced W boson to decay 
into electron for both signal and background processes. Therefore the total statistics 
should be double when the muon channel is considered as well. Lastly, the model 
of the underlying event (UE) in Herwig is known for the absence of multiple parton 
interaction component. We expect that our results are not crucially affected by its 
correction since we are considering a narrow jet regime in which the perturbative 
contribution is predominant. 
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A. Appendix 

In this Appendix, we show distribution of electromagnetic calorimeter-based vari- 
ables, i?em and Eiso, for h — >■ gg, h ^ bb, h ^ uu and h — >■ t+t~, respectively. 
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Figure 7: Distributions of electromagnetic calorimeter-based variables, i?cm and i^iso, for 
h ^ gg (upper left), h ^ bb (upper right), h ^ uu (lower left) and h t~^t~ (lower 
right). 
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